On Improving Dependency Implication Algorithms

نویسندگان

  • Wai Yin Mok
  • David W. Embley
چکیده

Let 2 be a finite set of join dependencies (JDs), multivalued dependencies (MVDs) and functional dependencies (IDS). To test whether .% implies an MVD X -+ Y or an PD X -+ A, the procedure in [ 31 first generates the dependency basis DEP( X) of X with respect to 2. Then, X + Y if and only if Y is the union of some elements in DEP( X) , and X -+ A if and only if A E DEP( X), and there is an PD V --f W in 2 with AEW-V. In this paper, we present two algorithms, Algorithms 1 and 2, that test the same implication problems. However, these algorithms do not generate DEP(X) with respect to 2, but rather directly determine whether X + Y or X + Y without computing the basis for all right-hand sides Z for X -+ Z or X -++ Z as in [ 31. As part of proving these algorithms correct, we also prove a result that is interesting in its own right. We organize this paper as follows. In Section 2, we present the basic definitions and some results in the literature that we need. In Section 3, we prove a theo-

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Generalized Baum-Welch and Viterbi Algorithms Based on the Direct Dependency among Observations

The parameters of a Hidden Markov Model (HMM) are transition and emission probabilities‎. ‎Both can be estimated using the Baum-Welch algorithm‎. ‎The process of discovering the sequence of hidden states‎, ‎given the sequence of observations‎, ‎is performed by the Viterbi algorithm‎. ‎In both Baum-Welch and Viterbi algorithms‎, ‎it is assumed that...

متن کامل

Extending dependencies for improving data quality

This doctoral thesis presents the results of my work on extending dependencies for improving data quality, both in a centralized environment with a single database and in a data exchange and integration environment with multiple databases. The first part of the thesis proposes five classes of data dependencies, referred to as CINDs, eCFDs, CFDcs, CFDps and CINDps, to capture data inconsistencie...

متن کامل

The Implication of Probabilistic Conditional Independence and Embedded Multivalued Dependency

It has been suggested that Bayesian networks and relational databases are different because the implication problems for probabilistic conditional independence and embedded multivalued dependency do not always coincide. The present study indicates that the implication problems coincide on solvable classes of dependencies and differ on unsolvable classes. We therefore maintain that Bayesian netw...

متن کامل

The Rank Minrelation Coefficient

Abstract: Bivariate (or pairwise) information measures such as mutual information or correlation are heavily used in variable selection and network inference algorithms mainly because they are faster and require fewer samples than multivariate (or multidimensional) strategies. This paper proposes a new relevance measure that aims at improving the detection of relevant variables based on pairwis...

متن کامل

Efficient Algorithms on the Moore Family Associated to an Implicational System

An implication system (IS) Σ on a finite set S is a set of rules called Σ-implications of the kind A→Σ B, with A,B⊆ S. A subset X ⊆ S satisfies A→Σ B when “A⊆ X implies B⊆ X” holds, so ISs can be used to describe constraints on sets of elements, such as dependency or causality. ISs are formally closely linked to the well known notions of closure operators and Moore families. This paper focuses ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Inf. Process. Lett.

دوره 64  شماره 

صفحات  -

تاریخ انتشار 1997